Learning a Reversi Board Evaluator with Minimax

نویسنده

  • Kevin T. Engel
چکیده

A board position evaluator is a crucial component for strong computer play in many games such as checkers, chess, and Reversi. The board evaluator is typically trained using pre-existing game data, an approach which is generally non-optimal, especially in the early stages of a game. Instead, we propose a new method which relies on Minimax search to train a series of models backwards from the endgame, propagating information about endgame scoring backwards to earlier positions. In the limit of perfect models, our method is optimal, converging to the full Minimax board evaluation. We test our method experimentally by training a simple Reversi model using both our Minimax method and high-level tournament game data. When played against each other, our model outperforms the game data model, averaging 5 more stones per game, and winning approximately 60% of the matches.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using an Evaluator Fixed Structure Learning Automata in Sampling of Social Networks

Social networks are streaming, diverse and include a wide range of edges so that continuously evolves over time and formed by the activities among users (such as tweets, emails, etc.), where each activity among its users, adds an edge to the network graph. Despite their popularities, the dynamicity and large size of most social networks make it difficult or impossible to study the entire networ...

متن کامل

Rion Snow

Here we present the algorithms and extensions that produced our successful Othello-playing program, which finished among the top five programs of this year’s Othello tournament. Specifically, we discuss a variety of search algorithms and extensions, as well as our algorithms for adaptive timemanagement, our efficient ‘bit board’ data structure, our features comprising our evaluation function, o...

متن کامل

Machine Learning of Othello Heuristics

The machine learning algorithm of [3] is applied to the problem of learning which heuristics to apply when playing the board game Othello. The problem is large, for there are 46,875 heuristics considered. The results are respectable; the Learner is able to beat a practiced human player approximately fifty percent of the time. Suggestions for improvement are included.

متن کامل

Parallel Minimax Tree Searching on GPU

The paper describes results of minimax tree searching algorithm implemented within CUDA platform. The problem regards move choice strategy in the game of Reversi. The parallelization scheme and performance aspects are discussed, focusing mainly on warp divergence problem and data transfer size. Moreover, a method of minimizing warp divergence and performance degradation is described. The paper ...

متن کامل

An Intelligent Othello Player Combining Machine Learning and Game Specific Heuristics

In this paper we present an intelligent Othello game player that combines game-specific heuristics with machine learning techniques for move selection. Five game specific heuristics have been proposed; some of which can be generalized to fit other games. For machine learning techniques, the normal Minimax algorithm along with a custom variation is used as a base. Genetic algorithms and neural n...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015